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ABSTRACT 

We use new observations of very weak CIV absorption lines associated 
with high-redshift Ly a absorption systems to measure the high-redshift Ly a 
line two-point correlation function (TPCF). These very weak CIV absorption 
lines trace small-scale velocity structure that cannot be resolved by Ly a 
absorption lines. We find that (1) high-redshift Ly a absorption systems 
with A(H I) > 3 x 10 14 cm -2 are strongly clustered in redshift, (2) previous 
measurements of the Ly a line TPCF underestimated the actual clustering of 
the absorbers due to unresolved blending of overlapping velocity components, 
(3) the present observations are consistent with the hypothesis that clustering 
of Ly a absorption systems extends to lower column densities, but maybe with 
smaller amplitude in the correlation function, and (4) the observed clustering is 
broadly compatible with that expected for galaxies at z ~ 2 — 3. We interpret 
these results as suggesting that many or most Ly a absorbers may arise in 
galaxies even at high redshifts, and, therefore, that the Ly a forest probes 
processes of galaxy formation and evolution for redshifts z <^ 5. 
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1. Introduction 

The observational result that high-redshift Ly a absorption systems appear not to 
cluster strongly in redshift (e.g. Sargent et al. 1980) has driven most discussion about 
the origin of the Ly a forest. This result has generally been interpreted as evidence that 
high-redshift Ly a absorbers arise in intergalactic clouds rather than in galaxies. Recent 
studies of the relationship between Ly a absorbers and galaxies at redshifts z however, 
directly demonstrate that many or most low-redshift Ly a absorbers (or at least those 
satisfying W rest (Lya) ^ 0.3A) arise in galaxies rather than in intergalactic clouds (Lanzetta 
et al. 1995). Why is it that Ly a absorbers appear not to cluster strongly in redshift 
whereas low-redshift Ly a absorbers appear to arise in galaxies? 

One suggestion is that there exist two distinct populations of Ly a absorbers: a 
rapidly evolving, unclustered, intergalactic population that dominates at high redshifts, 
and a slowly evolving, clustered, galactic population that dominates at low redshifts 
(e.g. Bahcall et al. 1995). Another possibility is that previous measurements of the 
high-redshift Ly a two-point correlation function (TPCF) have underestimated the actual 
clustering of the absorbers — presumably due to unresolved blending of overlapping velocity 
components — and Ly a absorbers arise in galaxies at all epochs. 

Here we examine the second of these possibilities, that previous measurements of 
the high-redshift Ly a TPCF have underestimated the actual clustering of the absorbers, 
using new observations of very weak CIV absorption lines associated with high-redshift Ly 
a absorbers (Cowie et al. 1995, hereafter CSKH), §2. These very weak CIV absorption 
lines trace small-scale velocity structure that cannot be resolved by Ly a absorption lines 
because (1) the atomic weight of C is 12 times the one of H, so the thermal broadening of 
CIV absorption lines is 3.5 times smaller than that of Ly a lines, and (2) CIV absorption 
lines suffer far less saturation because of the difference in column densities. We show that 
the CIV lines indeed help to reveal the underlying velocity correlation of the Ly a systems, 
and that this same velocity structure is blended away in the Ly a data, §3. We conclude 
with a comparison of the derived velocity clustering of the Ly a absorbers with that of 
galaxies at the present epoch, §4. 

2. Data 

The observations by CSKH consist of high spectral resolution (FWHM 8 km s _1 ), 
high signal-to-noise ratio (S/N ~ 50 per resolution element) spectra of three QSOs obtained 
with the Keck telescope and the HIRES spectrograph. The observations generally cover 
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both the Ly a and the corresponding CIV wavelength regions and are sensitive to CIV 
absorption lines arising in CIV column densities as low as iV(CiV) w 10 12 cm -2 . 

From the observations, CSKH selected a complete sample of 38 Ly a absorption 
lines satisfying iV(H I) > 3 x 10 14 cm -2 . They then eliminated seven of these absorption 
lines due to contamination by unrelated metal absorption lines or lack of coverage of 
the corresponding CIV wavelength region or because the lines produce corresponding 
Lyman-limit absorption (which indicates iV(H I) ^ 2 x 10 17 cm -2 ). The resulting sample 
thus contains 31 Ly a absorption lines satisfying 3 x 10 14 cm~ 2 < iV(H I) < 2 x 10 17 cm~ 2 . 
For each member of this sample, they searched the corresponding CIV wavelength region 
for CIV absorption lines and applied a Voigt profile fitting procedure to the identified CIV 
absorption lines to measure redshifts, Doppler parameters, and column densities. 

Here we use the absorption system parameters derived by CSKH in their profile 
analysis, which are summarized in their Table la. The average redshift of the absorbers is 
(z) = 2.6, the median column density of the absorbers is iV(H I) = 8.1 x 10 14 cm -2 , and the 
typical CIV/HI ratio of the absorbers is 3 x 10 -3 . Of the final sample of 31 Ly a absorption 
lines, 15 are observed to have associated CIV absorption, of which six show small-scale 
velocity structure with between two and nine velocity components per Ly a absorption line. 

3. Analysis 

3.1. High- Redshift Ly a Two-point Correlation Function 

Our primary assumption is that very weak CIV absorption lines trace small-scale 
velocity structure that cannot be resolved by Ly a absorption lines. Hence the goal of the 
analysis is to measure the high-redshift Ly a TPCF by using very weak CIV absorption 
lines instead of the Ly a absorption lines themselves. 

To do this we use the results summarized in Table la of CSKH. In cases where CSKH 
identified one or more CIV absorption lines with a single Ly a absorption line, we use the 
redshifts of all CIV absorption lines in the analysis. In cases where CSKH identified no CIV 
absorption lines with a single Ly a absorption line, we use the single redshift of the Ly a 
absorption line in the analysis. This procedure yields a total of 52 absorption redshifts. We 
then use these absorption redshifts to construct the Ly a line TPCF by normalizing the 
distribution of velocity pairs with respect to an unclustered distribution of redshifts. 

The results are shown in Figure 1, which plots in the upper panel the high-redshift Ly 
a line TPCF as traced by CIV absorption lines. (The error bars shown in Figure 1 are 
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based on a modified "bootstrap" technique that yields approximately correct results even 
for correlated data. Details of this technique will be presented elsewhere.) It is clear from 
Figure 1 that the high-redshift Ly a TPCF indicates very strong clustering on velocity 
scales 250 km s -1 . We therefore conclude that high-redshift Ly a absorption systems 
with iV(H I) > 3 x 10 14 cm -2 are strongly clustered in redshift. 

3.2. Blending of Overlapping Velocity Components 

The results of §3.1 demonstrate that high-redshift Ly a absorption systems with 
iV(H I) > 3 x 10 14 cm -2 are strongly clustered in redshift, whereas all previous analyses have 
found that they are either unclustered (Sargent et al. 1980) or only very weakly clustered 
in redshift (e.g. Webb 1987; Barcons & Webb 1991). How are these results compatible? 

To examine this issue, we apply the standard method of measuring the Ly a TPCF to 
models of the Ly a absorption lines observed by CSKH. We first generate a set of Ly a 
absorption lines according to the results in Table la of CSKH. We adopt a constant CIV/HI 
ratio of 3 x 10~ 3 and assume that the Doppler parameters are due to thermal motions, 
convolve the synthetic absorption lines with the appropriate instrumental response and 
add noise to match the actual signal-to- noise ratio of the observations. Next, we fit the 
resulting synthetic spectra using the Voigt profile fitting routine described previously by 
Lanzetta & Bowen (1992). For each absorption line we add velocity components until the 
decrease in \ 2 is smaller than the accompanying decrease in degrees of freedom, v. Finally, 
we construct the Ly a TPCF according to the procedures described in the previous section, 
but this time using the fitted redshifts instead of the actual redshifts. 

The results are shown in Figure 1, which plots in the lower panel the high-redshift 
Ly a TPCF as traced by Ly a absorption lines. It is clear that the Ly a absorption lines 
cannot reveal the strong clustering indicated by the CIV absorption lines. The lower panel 
of Figure 1 may be directly compared with the high-redshift Ly a TPCF presented by Hu 
et al. (1995); both use observations of nearly the same quality, and both obtain practically 
identical results. Note that our stopping criterion for velocity components, A% 2 < Au 
purposely allows even marginally significant lines to be included. If no correlation is 
obtained even with this generous criterion, it certainly will not be found with a more 
conservative one. We therefore conclude that previous measurements of the high-redshift 
Ly a TPCF have underestimated the actual clustering of the absorbers due to unresolved 
blending of overlapping velocity components. 



- 5 - 




Fig. 1. — High-redshift Ly a TPCF as traced by very weak CIV absorption lines (upper 
panel) and as traced by Ly a absorption lines (lower panel). 
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3.3. Extension to Lower Column Densities 

The results of the previous section demonstrate that previous measurements of the 
high-redshift Ly a TPCF of absorbers with iV(H I) > 3 x 10 14 cm~ 2 have underestimated 
the actual clustering of the absorbers. Can this result extend to lower column densities for 
which blending is presumably weaker? 

To examine this issue, we repeat the analysis described in the previous section for 
Ly a lines generated in two different ways. For the first simulation we assume that the 
CIV Doppler parameter b is entirely due to thermal motion, so b(HI) = \f\2b{C 'IV) , and 
reduce the HI column densities by a factor of 100 with respect to the original ones. An 
example of just how a Voigt profile fit to high spectral resolution, high signal-to-noise ratio 
observations can underestimate the actual number of velocity components comprising an 
absorption line is shown in Figure 2, in which panel (a) shows the result of synthesizing the 
complex of lines at z — 2.7853 toward Q0302— 003 (with HI column densities decreased by 
a factor of 100 with respect to the original ones), panel (b) shows the actual components 
of the Ly a absorption line, and panel (c) shows the result of the Voigt profile fitting 
procedure. This complex of nine lines is adequately fitted (x 2 / u — 0.91) with only three 
velocity components. The derived spectrum is then fitted in the same way used in §3.2. The 
resulting TPCF, Figure 3a, is still weaker than that of the CIV lines, but clearly detectable. 

The assumption that all the velocity dispersion is thermal leads to temperatures in 
excess of 6 x 10 4 K in some cases, and this is inappropriate in most models (see Charlton, 
1995, for a review of the models). We therefore add a second simulation in which the 
temperature is assumed to be 2 x 10 4 K, and any excess Doppler parameter is ascribed to 
turbulence and applied equally to the CIV and Ly a lines. In a few cases the CIV Doppler 
parameter is just below the assumed thermal value, and in these cases we simply adopt the 
2 x 10 4 thermal width for Ly a. In this simulations the HI column density is assumed to be 
10 x that for CIV for each component. The Ly a lines are now generally narrower than in 
the first simulation, and so the component structure is more easily detected. Consequently, 
the Ly a line TPCF will have larger values at low velocity separations, as can be seen from 
Figure 3 (b). This should be compared with the observational result that little clustering is 
found at these redshifts (e.g. Rauch et at. , 1992). 

These simulations are indeed too naive, as we are not taking into account the increment 
in line number density at low column densities. This increment will produce strong blending 
effects among low column density lines themselves and also with the higher column density 
lines. Their combined effect is very difficult to simulate, as it depends very strongly on 
the higher-order correlation functions of the distribution of the lines. In addition, there 
are observations that suggest that the amplitude of the clustering is smaller at these low 
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Fig. 2. — Example of how a Voigt profile fit to high spectral resolution, high signal-to-noise 
ratio observations can underestimate the actual number of velocity components comprising 
an absorption line. Panel (a) shows the result of synthesizing the complex of lines at 
z = 2.7853 toward Q0302— 003 (with HI column densities decreased by a factor of 100 
with respect to the original column densities), panel (b) shows the actual components that 
comprise the Ly a absorption line, and panel (c) shows the result of the Voigt profile fitting 
procedure. 
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column densities (Hu et al. 1995). All of these effects could very well erase all the signal 
in the correlation function for low column density lines, and hence we cannot conclude 
anything on the behavior of these low column density lines other than it is compatible with 
being clustered but maybe with a smaller clustering amplitude. 

4. Discussion and Summary 

The most significant result of the previous sections is that high-redshift Ly a 
absorbers with iV(H I) > 3 x 10 14 cm -2 are strongly clustered in redshift on velocity 
scales ^ 250 km s _1 . While the effect might be due to pairs of clouds with small velocity 
differences causing the observed TPCF (Miralda-Escude et al. 1995; Rauch 1995), we could 
be seeing real clustering. With the observed velocity correlation length we can not decide 
whether the Ly a absorbers are independent entities, as has generally been assumed so far, 
or clouds within the halos of galaxies, the possibility we are exploring here. More detailed 
questions are even harder to answer, for example the type of galaxies in which the absorbers 
might reside, whether we are observing multiple clouds within the same galaxies, and the 
ionization state of carbon in the clouds. We can only ask if the strength of Ly a clustering 
is consistent with expectations of galaxy clustering at these early epochs. 

To examine this issue, we consider a simple model for the evolution of the galaxy 
TPCF. In a first step, we ignore peculiar motions and motions of clouds within galaxies 
and assume that as a function of velocity and redshift the galaxy TPCF can be described 
by (Efstathiou et al. 1991) 

-1.8 

(1) 

where H(z) is the Hubble constant at epoch z (we take go — 0.5) and H r = 550 km s _1 
is the present-day galaxy correlation length. The evolutionary parameter e takes the 
value —1.2 for comoving structures, for virialized clusters, and 0.8 for linearly growing 
perturbations. Recent theoretical studies (Hamilton et al. 1991; Jain et al. 1995) show a 
steeper dependence on redshift at intermediate stages between the linear and virialized 
limits. 

To avoid the divergence of this function at small values of v, we take £(v,z) to be 
constant below a given velocity difference vq and equal to £(i>o, z). We also convolve it with 
a Gaussian distribution with width a to account for random motions. In this way we get 
a set of different models defined by three parameters, e, a, and t>o, which we allow to vary 
within the limits: -2 < e < 4, < a < 500 km s" 1 , and 1 < v < 80 km s _1 . 
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Fig. 3. — The TPCF for low column density clouds as obtained with two different models: 
Thermal-only broadening (panel a) and thermal plus turbulence (panel b). 
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Predictions of all these models are then compared with the observed TPCF of 
high-redshift Ly a absorbers. The best fit is achieved for e = 2.4 and a = 100 km s _1 . 
The 1-, 2- and 3-cr confidence regions obtained using vq as an uninteresting parameter are 
plotted in Figure 4. From this calculation it is clear that if normal galaxies host the Ly a 
absorbers, at an average rate of one per galaxy, their correlation function is evolving rapidly 
and the combined intragalactic and intergalactic velocity dispersion is <^ 150 km s _1 . 

Note that our redshift range is relatively small, Az 0.6, so that we can not 
separately fit tq and e. Our determination of e is therefore anchored by the general 
galaxy correlation function at the present epoch. This may be inappropriate in several 
respects. We overestimate the correlation function if the host galaxies of the Ly a clouds 
are less clustered at the present epoch, for example if they are mostly spiral galaxies, or we 
underestimate it if there are multiple Ly a absorbers in galaxies at high redshift. The best 
we can deduce from our simple analysis is that the observed clustering of high-redshift Ly 
a absorbers is broadly consistent with the expected clustering of galaxies. 

We conclude that (1) High-redshift Ly a absorbers with JV(H I) > 3 x 10 14 cm -2 are 
strongly clustered in redshift on velocity scales <; 250 km s -1 , (2) Previous measurements 
of the Ly a TPCF have underestimated the actual clustering of the absorbers due to 
unresolved blending of overlapping velocity components, (3) The present observations may 
be consistent with the hypothesis that clustering of Ly a absorption systems persists to 
lower column densities, being likely that the clustering is smaller at low column densities, 
and (4) The observed TPCF is broadly compatible with that expected from galaxies at 
z ~ 2 - 3. 

We interpret these results to suggest that many or most Ly a absorbers may arise in 
galaxies at all epochs, and therefore that the Ly a forest probes the processes of galaxy 
formation and evolution for redshifts z ^ 5. 
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grant NAGW-4433 and by a Career Development Award from the Dudley Observatory, 
and AY by NASA grant NAG-51228. 
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Fig. 4. — Confidence limits in the parameter space formed by the clustering evolution 
parameter e and the typical velocity of galactic halo motions a. 1— ,2— and 3cr confidence 
contours are plotted. 
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